Stabilizing variable selection and regression
نویسندگان
چکیده
We consider regression in which one predicts a response Y with set of predictors X across different experiments or environments. This is common setup many data-driven scientific fields, and we argue that statistical inference can benefit from an analysis takes into account the distributional changes In particular, it useful to distinguish between stable unstable predictors, is, have fixed changing functional dependence on response, respectively. introduce stabilized explicitly enforces stability thus improves generalization performance previously unseen Our work motivated by application systems biology. Using multiomic data, demonstrate how hypothesis generation about gene function regression. believe similar line arguments for exploiting heterogeneity data be powerful other applications as well. draw theoretical connection multi-environment causal models allows graphically characterize vs. response. Formally, notion blanket subset lies direct Markov blanket. prove this optimal sense based these minimizes mean squared prediction error, given resulting generalizes new
منابع مشابه
Variable Selection in ROC Regression
Regression models are introduced into the receiver operating characteristic (ROC) analysis to accommodate effects of covariates, such as genes. If many covariates are available, the variable selection issue arises. The traditional induced methodology separately models outcomes of diseased and nondiseased groups; thus, separate application of variable selections to two models will bring barriers...
متن کاملVariable Selection for Regression Models
A simple method for subset selection of independent variables in regression models is proposed. We expand the usual regression equation to an equation that incorporates all possible subsets of predictors by adding indicator variables as parameters. The vector of indicator variables dictates which predictors to include. Several choices of priors can be employed for the unknown regression coeecie...
متن کاملVariable Selection in Quantile Regression
After its inception in Koenker and Bassett (1978), quantile regression has become an important and widely used technique to study the whole conditional distribution of a response variable and grown into an important tool of applied statistics over the last three decades. In this work, we focus on the variable selection aspect of penalized quantile regression. Under some mild conditions, we demo...
متن کاملVariable Selection in Semiparametric Regression Modeling By
In this paper, we are concerned with how to select significant variables in semiparametric modeling. Variable selection for semiparametric regression models consists of two components: model selection for nonparametric components and selection of significant variables for the parametric portion. Thus, semiparametric variable selection is much more challenging than parametric variable selection ...
متن کاملVariable Selection for Multivariate Logistic Regression Models
In this paper, we use multivariate logistic regression models to incorporate correlation among binary response data. Our objective is to develop a variable subset selection procedure to identify important covariates in predicting correlated binary responses using a Bayesian approach. In order to incorporate available prior information, we propose a class of informative prior distributions on th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Annals of Applied Statistics
سال: 2021
ISSN: ['1941-7330', '1932-6157']
DOI: https://doi.org/10.1214/21-aoas1487